Speech LSI 
Products 
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OKI looks back on 113 years of history and experience in producing 
electric and electronic quality products of repute. OKI established its 
Electronic Devices Group in 1961, and has been designing and manufacturing 
high quality LSIs such as memories, microcomputers, telecom LSIs, a 
large variety of special purpose circuits and...ADPCM speech chips, ever 
since. 


OKI’s_ speech LSIs_ enjoy remarkable 
popularity world-wide. Their superior speech 
quality, as aresult of the company’s refined 
algorithm, has assured OKI a leading position 
among suppliers of speech chips. 


OKI speech circuits are easy to apply. Their 
internal circuit design allows a system based on 
an OKI speech chip to be operated as simply 
as a conventional tape recorder. Some circuits 
are designed for straightforward connection to 
a CPU or existing digital systems. 


Speech LSIs are installed in many familiar 
commodities used in everyday life, such as 
answering machines, clocks, cameras, toys, 
_ telephones, greeting cards, office automation 
equipment, alarm systems, clinical facilities, 
musical instruments, etc. The range of 
applications is steadily expanding, and now also includes automotive use 
and the so-called brown and white goods. The facility of speech will in 
future be incorporated in an ever-increasing number of applications. 


The new products introduced herein continue the tradition of the present 
range by placing unprecedented performance in the hands of the designer 
at an affordable price. OKI intends to continue to enhance a comprehensive 
product family aiming at ever increased performance and integration. 


Adagio 


OKI’s ADPCM recorder devices are tailored for high demands in 
speech quality at equally high economy. With the new ADPCM 
voice processor MSM6688 and a Sub Band Coding processor 
MSM6789 this high speech quality can be continued for more 
than an hour recording time; this aspect is particularly interesting 
for telephone answering machines. 


Chip form supply for selected products and small size packages 
save board space and allow full surface mount implementation 
thanks to OKI’s advanced IC packaging variety. 


NOTICE: 

PC-AT is a registered trademark of (nternational Business Machines Inc. 
Textool is a registered trademark of 3M Corporation. 

MS-DOS is a registered trademark of Microsoft Corporation. 


The Concept 


ADPCM differs from other methods in that it does not 
truly synthesise speech. It initially involves digitising, 
compressing and storing actual analog sounds (digital 
recording). In order to re-create the original sound, the 
compressed digital word is expanded back to its original 
size, converted into an analog signal, amplified, and 
played through a speaker. 

In fact, any kind of sound...voice, music, sound 
effects...etc., can be digitised, stored, and re-created 
faithfully with a high degree of naturalness. The idea 
behind this concept is to reduce the effective data rate in 
order to use memory economically and to minimize the 
data amount when transmitting voice data, while 
maintaining sufficientredundancy for faithful reproduction 
accuracy. In other words, maximum intelligibility and 
naturalness for all applications. 


Common Features 


e ADPCM and SBC algorithms for coding and decoding, 

¢ Suitable for speech and sound effects, 

¢ High speech intelligibility and sound naturalness, 

¢ Wide range of sampling frequencies, 

¢ Full telephone bandwidth and more, 

¢ Comprehensive functionality, 

¢ Easy to apply with only a few external parts, 

° High design flexibility, 

¢ Many analog extras on-chip, 

¢ Reasonably priced, 

¢ High device quality and reliability, 

¢ Through-hole & SMD packages, 

¢ Low power CMOS process technology. 

¢ Typical applications: 
Handicap aids, Medical systems, Teaching aids, 
Answering machines, Telecommunication, 
Industrial equipment, Computer systems, 
Consumer goods, Toys...and more. 
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For External Memory 


MSM5218 - Proven Track Record 


This LSI enjoys world-wide popularity thanks to its real-time 
analysis/synthesis capability and high versatility inuse. Theinexpensive 
MSM5218 gives the user nearly unlimited design flexibility, External 
conversion accuracies from 8 to 12 bits and a wide range of adjustable 
sampling frequencies allow economical designs with optimized voice 
quality. A built-in data overflow protection circuit minimizes distortions 
as aresult of excessive analog inputs during recording. MSMS5218 can 
be added to any existing digital system. 


MSMS5205 - The Playback Partner 


An unparalleled low-cost playback-only version of the MSM5218 
housing the same desirable features for ADPCM reproductions from 
external ROM. Voice reproduction is implemented via a 10-bit D/A 
converter. MSM5205 can be added to enrich any digital system with 
voice output. 


MSM6585 - The Big Brother 


Functionally identical to MSM5205, this device comes with a 12- 
bit D/A converter plus internal low-pass filter. Apart from saving parts 
count, MSM6585 offers superior reproduction fidelity due to the higher 
accuracy of its D/A converter. In order to upgrade existing MSM5205 
boards with MSM6585, the pin assignmenthas been largely maintained. 
Provided a MSM5205 board is operated at 4-bit ADPCM data, the 
devices can be exchanged in the socket with only the clock resonator 
replaced. Owing to the high reproduction accuracy, MSM6585 is 
recommended for CD-ROM applications, such as car navigation 
equipment. 


MSM6388 - 12 Bit Integrated Power 


The main attractions are undoubtedly the internal 12-bit ADC and 
12-bit DAC which guarantee a significant gain in speech quality. 
The maximum J/O time of 262 seconds at 4KHz sampling is 
achieved by means of the interface that controls four voice register 
LSIs, such as the OKI MSM6389A. MSM6388 is pin-switchable 
between stand-alone and MCU interface modes. Digital and analog 
circuits are combined on the same chip, including two OpAmps and 
a low-pass filter. The master clock is 4.096 MHz typically allowing 
a sampling range from 3.5 to 9.1 KHz. The sampling frequencies can 
be increased proportionally if a higher clock frequency is selected. The 
available memory capacity can be split into 1 to 8 channels in the 
stand-alone mode. Operation control in the MCU mode allows more 
than 8 channels and is implemented over a 4-bit control bus, by which 
the user accesses numerous registers to control start, stop, record, 
play, start-stop addresses, sampling frequencies, etc. In total, 16 
powerful instructions are built in for high application convenience. 
MSM6388 is particularly suitable for answering machines with up to 
5 minutes recording time. 
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For External Memory 


MSM6588 - The Professional 


With basically the same features as MSM6388, the MSM6588 
additionally provides voice triggered starting of the record function, 
pause mode anda simplified, but equally powerful set of 13 commands 
with 4 times faster processing speed in the MCU mode (4-Megabit data 
download in 16 sec.) Another extra for the stand-alone mode is the 
possibility to define 1 to 8 recording channels with fixed or flexible 
length. A hardware reset input has also been included. MSM6588 
operates at 3-bit ADPCM in the stand-alone mode and at 3 or 4 bit 
selectable in the MCU mode. 


NEW! MSM6688 - The Latest Generation 


MSM6688 is essentially based on MSM6588, but incorporates 
additional features designed for long-time answering machines. It 
addresses up to 32-Megabit serial registers (4 units MSM6685, 8- 
Megabit serial register) resulting in a considerable recording time of 35 
minutes at 16kbps. Like MSM6388 and MSM6588, it provides both 
MCU and stand-alone operating modes in both supporting 3-bit or 4- 
bit ADPCM data. Its stand-alone mode provides 63 recording channels 
administered by a data header on top of the interfaced register bank. In 
the MCU mode, a powerful instruction set of 14 commands makes full 
use of the speech processor’s capabilies, including a preset of recording 
time length. Voice trigger, channel erase, various recording modes, 
speech/address data transfers and a hardware reset are useful extras in 
both modes. Serial voice registers (volatile) and serial voice ROMs 
(non-volatile) can be combined for incoming and outgoing messages 
from the same bank. Thanks to a separate data input, up to 4Mbit voice 
ROM can be added to the maximum register capacity of 32Mbit. 
MSM6688 is under development. 


Economical Voice Registers and Voice ROMs 


Serial registers are designed to interface with speech processors to 
store compressed speech data. Four registers are addressed serially by 
the voice LSIs. The capacity for recording and playback can be 
expanded significantly by switching multiples of four voice registers. 
The register access cycle timeis 2.5 to4us while the current consumption 
is as low as to allow battery backup for extended periods. Up to 1- 
Megabit registers are for MSM6388 and MSM6588 while the 4-and 8- 
Megabit registers for MSM6688 and MSM6788 are under development. 


Serial voice ROMs are used in conjunction with serial registers for 
the storage of non-volatile outgoing messages such as time stamps or 
voice prompts. They are available in SMD packages with 1, 2 or 3- 
Megabit capacities, while the latter two are internally split into one 
Megabit banks switched by separate select inputs. 


Serial registers and ROMs combine the advantages of DRAMs 
(high capacity) and SRAMs (low current) in smaller packages. 
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Sree : Memory Bank Configurations. Notes 


prama7 1 X 4-Megabit + 3 x 1-Megabit DRAM U7 is for fixed outgoing messages 
pram#s 8X 1-Megabit DRAM and can be discarded if not needed. 
8 x 4-Megabit DRAM 
2 x 16-Megabit DRAM The same schematic applies to the 
ADPCM recorder MSM6688. 


Advance Info: MSM9880 


DSP Recorder MSM9880 


Coring a digital signal processor, DSP, with a proprietary 
compression algorithm, OKI Electric will introduce a speech recorder 
LSI, MSM9880, designed for fully static telephone answering machines. 
With a typical bit-rate of 4.8kbps approximately 14 minutes recording 
can be realised in conjunction with a4 Megabit dynamic RAM. With 
the silence deletion function activated even longer recording times will 
be achieved, since the average bit-rate lowers to approximately 3.5kbps. 


MSM9880 is controlled via a parallel CPU interface and provides 
a powerful set of software commands. Playback can be done in three 
different ways, at normal speed as recorded, at higher speed and at 
lower speed. When the speed changes, the pitch is not affected. Higher 
speed allows faster listening to the messages, while slower speed 
allows for instance typewriting during playback. 


In an event in which the calling party cuts the connection, the 
resulting busy tone will automatically be erased from the recording 
area. ; 


The major functional blocks include a 12-bit linear Codec (8 kHz 
sampling), low-pass filter, DIMF generator and receiver for remote 
access, a DRAM and serial register interface with direct addressing 
capability and a CPU parallel interface. Up to 32 Megabit memory can 
be connected allowing the recording of up to 200 phrases (messages) 
or less. 


MSM98380 is clocked by a 12.8MHz crystal during operation and 
switches automatically to RC oscillation as the chip enters the power 
down mode. During power down, the refresh cycle of the memories is 
extended to achieve a standby current of no more than 200pA. During 
operation, the maximum power consumption is expected to be 200mW 
typically at +5V single supply. 


Tentative Features 


- Internal 12-bit linear A/D and D/A 

- Internal low-pass filter 

- Average bit-rate without silence erasure 4.8kbps 

- Extended recording time with silence erasure at 3.5kbps 

- Parallel control interface for 4 or 8-bit CPUs 

- Operation control by CPU commands 

- DRAM/ARAM interface 1 Megabit x 4, 8 units max. 

- Bit failure scanning for 4Mb ARAMs upon power-on in 0.8 seconds 
- Serial Voice ROM interface 1/2/3 Meg units, 4 Meg max. 
- Recording time 14 minutes (4.8kbps, 4 Meg DRAM/ARAM) 
- Maximum 200 phrases provided for messages 

- Programmable voice level triggered recording 

- Normal, fast and slow playback without affecting pitch 

- Automatic busy tone deletion 

- Built-in DTMF generator and receiver 

- Power supply single +5V 

- Operating current 40mA max. 

- Operating temperature -10 to +70°C 

- Power-down current 200A max. 

- Master clock 12.8MHz typ. 

- 100-pin QFP or 128-pin QFP 


Specification may be changed without prior notification. 
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With Internal ROM 


MSM6375 - The Popular Family 


MSM6375 is a maskable speech synthesizer with 1 Megabit 
internal ROM permitting a reproduction time of 64 seconds at 4KHz 
sampling. Upto 111 phrase addresses can be set by switches or by 
a low-cost MCU to compile individual phrases into proper sentences. 
Special features include playback through two channels for echo or 


BGM generation. A signal attenuation of up to -12dB in three steps can _ 


be realised. What is more, a beep tone of various frequencies can be 
generated for selectable durations. The typical sampling frequencies 
are4, 6.4, and 8KHz which can be extended up to 32KHz by variation 
of the clock frequency. MSM6375 is equipped with a 12-bit D/A 
converter and with aninternal low-pass filter for maximum reproduction 
quality. It operates between -40 to +85°C, thus permitting industrial 
applications. Several masking options permit individual matching with 
given applicationrequirements. Thanks to their low power consumption, 
these devices are battery operable. The ROM synthesizer is available 
as a set of five LSIs, which differ only in the ROM size: 


Maximum playback times are based on 4 kHz sampling frequency. 


Speech code masking is performed in the factory based on the 
customer s EPROM, whichis programmed by means of OKIs dedicated 
PC-based development kit, AR76-202, and verified using the evaluation 
board 6376. Alternatively, a user-programmable OTP-version is 
available for quick prototype evaluation and for low to medium size 
productions. MSM63P74 comes in a 20-pin DIP providing two more 
pins than ROM devices. These are used for programming and are 
located at positions 1 and 20. The lower pins are identical to the ROM 


versions. 


MSM6376 - The Allrounder 


Comprising the same functions as the ROM synthesizers, this 
MSM6376 omits the internal memory to favor off-chip solutions for 
voice code storage. This capability covers an addressing range of up to 
16 Megabits external (EP)ROM which provides a total output time of 
17 minutes at 4 kHz sampling frequency. Designed to emulate the ROM 
synthesizers, every available mask bonding option can be simulated by 
pin selections, including beep tone, 2-channel playback, echo generation, 
MCU or switch mode, oscillator type, etc. This device is an ideal 
solution particularly for quantities which would not justify a mask. It 
enables phrases to be compiled into sentences, and also permits 
playback in the form of a single long phrase. 

MSM6376 has prevailed for multi-language applications by means 
of message reproduction from separately selected EPROMs. 


MSM6374 WITH READY-TO-USE CLOCK CODE ("007") 


MSM6375 
FAMILY 
ROM-CHIPS 


In addition to custom mask devices three standard speech codes are 
available for "talking clock" applications. All parts are identical to the 
custom mask devices. Speech phrases are compiled by an MCU in 
accordance with the individual address table. The speech codes are in 
the English language with different recording parameters as follows: 


MSM6374-007 
Female voice sampled at 6.4KHz for time announcement. 


MSM6374-544 
Female voice sampled at 6.4KHz for time and day-of-the-week 
announcements. 


MSM6373-329 
Male voice sampled at 4.0KHz for time and day-of-the-week 
announcements. 


NN SSS 
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With Internal ROM 


TYPICAL APPLICATION WITH CONTROLLER MODE MSM6650 - The New Family 


The MSM6650 family consists of five ROM versions, one OTP 
version and one version for external memory. They are based on OKI 
Electric’s successful MSM6375 family and feature upgraded functions 
and performance. Depending on mask selection, the LSIs operate on a 
stand-alone basis or controlled by an MCU. In the stand alone mode, 
speech is reproduced immediately upon address selection with no 
more start pulse required. In the MCU mode, two basic means of 
control are provided, a parallel bus or a serial bus, either of which is 
selected by mask option. If the serial interface is selected, two MCU 
I/Os are added by the synthesiser. The quality of the speech reproduction 
is based on the sampling frequency, which can be selected from a range 
of 4 KHz to 32KHz. Alternatively, straight 8-bit PCM coding can be 
used. Moreover, each phrase may be sampled at different sampling 
frequencies or different data formats in order to achieve maximum 
memory economy whilst obtaining optimum speech quality. Besides 
the ability to form sentences from 127 different phrases, the following 


TYPICAL APPLICATION WITH STAND-ALONE additional features have been implemented: 


¢ Melody generation, 

* Two-channel or echo reproduction, 
¢ Level attenuation, 

¢ Fade out function, 

¢ Four different beep tone frequencies 
¢ Random playback of phrases. 


The melody function is realised by means of a compression that 
takes advantage of data repititions and results in an average bit-rate of 
4 to 5 kbps. Maximum voice quality is achieved by the internal 12-bit 
D/A converter and 4th order low-pass filter. What is more, the LSIs 
come in low pin-count packages or in chip form and offer a new 
approach to typical applications such as telephones, facsimile 
equipment, household appliances, alarm systems, AV, computers, 
measuring equipment, watches, thermometers and, of course, games 
and toys. For MSM6650 (ROM-less version), the voice EPROM can 
be used directly in the application circuit, while up to a maximum of 
64Mbit external memory (typically EPROM) can be addressed. 


Product Line-up 


Maximum playback times are based on ADPCM 
at 4 kHz sampling frequency (ADPCM bit-rate 16kbps). 
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Specification Overview 


Active Current (max. 
Operating Temperature -30~+70°C -30~+70°C | -40~+85°C 
Packaging 24-DIP 18-DIP 18-DIP 44-QFP 

MCU I/F 


Notes Upgrade of SA/MCU 
Voc trigger DRAM I/F 


MSM5205 Register |/F 
Registers refer to serially addressable voice registers, such as MSM6389, MSM6586, MSM6587 (volatile) and MSM6595, MSM6596, MSM6597 (non-volatile). 
Typical output times are given for 4 KHz sampling frequency and with the chip’s own addressing capability without expansions. 
* Data is preliminary as under development 


no I/F LSI 


Register I/F 


Typ. Output Time (sec) 
Number of Channels 


16-DIP or CHIP 18-DIP 20-DIP 18-DIP 18-DIP 64-QFP 
24-SOP — 24-SOP 24-SOP 64-SDIP 
Chip Chip Chip 
Echo Beep tone Beep tone MSM6374 Beep tone Beep tone MSM637X 


Typical output times are given for 4 KHz sampling frequency and with the chip’s own addressing capability without expansions. 


LEGEND: 

Register: Serial Voice Register and Serial Voice ROM 
Recorder: ADPCM analysis function (recording and playback) 
Player: ADPCM synthesis function (playback only) 
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Specification Overview 


| msmess2_| msmecss | wsmess4_| 
PCM Data Length 
Oso. Frequency (H2) 
Sampling Rate (KH2) 
| 64M ROM _| 
ee eed 
a Ae | 


msmesPs4 | msmesss | msmesse | msmesso_| 


PARAMETER 


Function 
ADPCM Data Length 


ee 
S 
oad 


extemal Mempary (bie es [ss ee 


Lee tars | 7 
Internal Low Pass Filter 40dB/oct. -40dB/oct. -40dB/oct. -40dB/oct. 40dB/oct -40dB/oct. -40dB/oct. 


if 

Supply Voltage 12.7-5.5V 12.7-5.5V 

Active Current (max. ismA 

Operating Temperature “A0~+85°O 
24-SOP 24-SOP 24-SOP 24-SOP 24-SOP 24-SOP 64-SDIP 

Notes 2-Ch/Echo 2-Ch/Echo 2-Ch/Echo 2-Ch/Echo 2-Ch/Echo 2-Ch/Echo EVA 

Melody/Beep | Melody/Beep Melody/Beep | Melody/Beep | Melody/Beep | Melody/Beep CHIP 


27 127 


Fade-out Fade-out Fade-out Fade-out Fade-out Fade-out 


PARAMETER - MSM6586 MSM6389A | msmesss | msmesos | 
Function Serial Reg Serial Reg 
262.144x1 1.048.576x1 


Organisation 
Capacity (bit) 
Address Units 


A 
100A 100yA 100yA 10yuA 10yA 10uA 
0~+70°C 0~+70°C 0~+70°C -40~+85°C -40~+85°C -40~+85°C 


Serial Access Time 
Serial Read/Write Time 
Operating Voltage 
Max. Active Current 


Max. Standby Current 


Operating Temperature 


5m 
Packaging 18-QFJ 18-QFJ 18-QFJ 18-QFJ 18-QFJ 24-SOP 
16-DIP 24-SOP 24-SOP 
One unit with | One unit with | 4 units with Only with Only with Only with 
6388/6588 6388/6588 6388/6588 Register(s) Register(s) Register(s) 


OKI ELECTRIC EUROPE GmbH Page 11 


Notes 


Special Purpose 


MSM6378A/6379 - The Programmable OTPs 


MSM6378A is a voice synthesizer including 256Kbit OTP (one- 
time programmable ROM), while MSM6379 is the equivalent with 
512Kbit ROM. Also internally comprising a 12-bit DAC and a 
buffered low-pass filter, the devices merely requires an external 
amplifier, a speaker and an RC network to be fully operable. For 
programming the internal OTP, OKI offers the ANAWRITER which 
is a tool designed for recording an analog source (without editing), 
conversion to ADPCM data and programming the device. Alternatively, 
the PC-based tool AR76-202 can be used with all editing possibilities. 
The vocabulary is stored in the OTP as one single phrase, which can be 
reproduced once or by means of endless-loop playback. The sampling 
frequencies are derived from the oscillation frequency of 64 to 256KHz 
through division by 16. Consequently, sampling frequencies from 4 to 
16KHz are practicable, resulting in a maximum playback time of up to 
32 seconds at 4 KHz sampling. Thanks to the one chip solution, 
MSM6378A and MSM6379 suit a wide range of applications, including 
personal voice card and other miniature devices designed to produce 
high quality speech. 


MSM6295 - The Musician 


Profiting from the high audio quality of the ADPCM algorithm, 
this LSI manages the mixing of four reproduction channels with 
individual controlling facilities related to level attenuation and channel 
selection. There are two types which interface with common CPUs. 
MSM6295 addresses 2 Megabitexternal (EP)ROM, while MSM6295V 
is designed for 1 Megabit (EP)ROM. The memory contains both the 
sound/speech phrases and the corresponding addresses of the individual 
phrase locations. Analog portions shall be designed externally to 
permit maximum design flexibility. High sampling rates up to 32KHz 
would permit applications such as musical instruments, rhythm 
generators, BGM, echo generation, etc. 


MSM6310 - The Continuous Recorder 


Unlike the other OKI speech circuits, MSM6310 adopts straight 
Pulse Code Modulation (PCM) to code and decode analog inputs. It 
is capable of both audioinput (recording) and output (playback). Once 
started, it records any audio input for a specified time, and then 
immediately continues to re-record the external memory with the 
continuing input, while the previous recording is erased at that 
instant. In other words, the memory consistently contains the most 
recent recording which can be retrieved any time by activating the 
playback function. An ideal solution for dictation machines or language 
trainers. Essentially, the chip consists of an 8-bit ADC and DAC, pre- 
amplifier, low-pass filter and amemory control unitincluding refresh 
generation. Memory is connected externally, 1 or 2 units of 256k x 1 
bit DRAM resulting in a maximum loop time of 8 or 16 seconds. 
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Special Purpose 


NOT JUST Boe MSM6722 z MSM06722 - The Voice Changer 


MSM6722 performs the pitch control of analog signals, i.e., voice. 
The pitch of an incoming signal can be shifted in real-time within the 
upper and lower octave relative to the base pitch level of the original 
input. What this device does in practice is to change the input voice to 
produce, for instance, a hoarse mickey mouse voice or similar. The 
input is sampled at 8KHz while the DAC varies the sampling rate 
between 4 and 16KHz in real-time. There are two different pin 
selectable control modes, one is the 17-stage up/down push button 
control, the other mode uses a four bit binary control interface. 

Internally, the device comprises OpAmps, input low-pass filter, an 
8-bit ADC, data processing unit (incl. 1 kilobitRAM), a9-bit DAC and 
a buffered output low-pass filter, all of which represent the signal path 
from input to output. MSM6722 comes in a 24-pin small outline 
package (SOP) and can be used as voice changer in telephones to 
discourage unwanted callers. 

THE MINI POWER PACK: MSC1191 


MSC1191/MSC1157 - The Small Power Packs 


MSC1191 has been developed for use with OKI’s Synthesizer LSIs 
which can be operated at low supply voltages. This Bic CMOS amplifier 
circuit essentially comprises two operational amplifiers, a bias circuit 
and a stand-by detector provision. It can be connected directly to a 
speaker without a coupling capacitor to achieve an audio power of 
0.3W maximum at8 . A flexible power supply range from +2V to+6V 
and low current requirement of 1.5mA typically are essential 
characteristics for battery operation. The amplifier is available in chip 
form or packaged in a standard 8-pin DIP or an 8-pin SOP. In addition 
to MSC1191, the new MSC1157 provides adjustable gain. 


MSM6588 AND A DYNAMIC RAM WITH INTERFACE LSIMSM6691 {JS M6691: The Interface For DRAMs 


<< 


MSM6691 is an interface LSI that is especially designed for 
MSM6388 and MSM6588 and converts addresses from serial to 
parallel and vice-versa. By this means, the voice processor can be used 
in conjunction with standard dynamic RAM upto 4 Megabit units. This 
can be used in conjunction with answering machines for short to 
medium recording times. A refresh generator is internally implemented 
providing “RAS-only” refresh. As for dynamic RAM, the organisations 
4-Megabit x 1 or 1-Megabit x 1 are supported (fast page mode or nibble 
mode, not static column mode). A memory bank may consist of 1 to 4 
units resulting in a maximum voice data area of 4 or 16-Megabit. With 
4-Megabit DRAMs, MSM6691 allows the selection of one of four 
DRAMs by activation and deactivation of chip select inputs by means 
of the inputs “BANKO” and “BANK1”. Otherwise, the circuit works 
exactly as if registers were used, including programmable channels 
and all MCU control features. The full configuration with 4-Megabit 
devices provides the following recording times for answering machines: 


17 min. 29 sec. at 16Kbps 
11 min. 39 sec. at 24Kbps 
8 min. 44 sec. at 32Kbps 


a 
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The ADPCM Algorithm 


ADPCM, Adaptive Differential Pulse Code Modulation, differs 
from other methods in that it does not truly synthesize speech. It 
initially involves digitizing, compressing and storing actual analog 
sounds (digital recording). To re-create the original sound, the 
compressed digital word is expanded back to its original size, converted 
into an analog signal, amplified, and played through a speaker. 

In fact, any kind of sound...voice, music, sound effects...etc., can be 
digitized, stored, and re-created faithfully with a high degree of 
naturalness. The idea behind this concept is to reduce the effective data 
rate in order touse memory efficiently and to minimize the data amount 
when transmitting voice data, while maintaining sufficient redundancy 
for faithful reproduction accuracy. In other words, maximum 
intelligibility and naturalness for all applications. 

ADPCM represents an improvement with respect to conventional 
techniques in that it adaptively changes the quantizer step size (scale 
factor) to suit the waveform being 
encoded. 

The drawing shows the block 
diagram of a typical ADPCM encoder. 
Input audio is filtered and then digitized 
by an A/D converter. This PCM input 
data, Xn, is compared with a PCM signal 
estimate, “Xn-1, calculated from the 
previous sample Xn-1. The resulting 
differential value, dn, isencoded relative 
to the current step size, (An), and output 
as an ADPCM data sample, Ln. The 
ADPCM data is also fed to the step size 
determination logic and the decoder. 

The ENCODER accepts the 
differential value, dn, from the 
comparator and the step size, An, and 
calculates a 4-bit ADPCM code. The 
DECODING logic transforms the output 
ADPCM data, Ln, back to a differential 
value, using the same step size data as 
the encoder. These differential values 
are accumulated to form an estimated 
(reproduced) waveform value, “X. The 
value for the previous sample is fed back to the comparator as ‘Xn-1. 
This feedback loop allows the system to recover ffom inputs that 
temporarily overload the encoder. 

The DECODER accepts ADPCM code values, L, and step size 
values, A, calculates a differential value, g, and accumulates an 
estimated waveform value, X. 

When the ADPCM device is reset, the step size, An, is set to 
minimum and the estimated waveform value, ‘X, is set to zero (half 
scale). The system is almost identical to a typical DPCM coder except 
that additional logic provides adaption of quantizer step size on the 
basis of the most recent ADPCM code output. 


Unlike true synthesizers, ADPCM is a sophisticated representative of 
waveform coders offering 4-bit accuracy. One bit, the MSB, is used 
as a sign bit, which denotes the direction of the original waveform, 
ascending or descending. Consequently, nearly any sound and any 
speech irrespective of the language can be processed and reproduced 
with surprising quality. 

In order to achieve telephone bandwidth, 8 kHz sampling is the 
tight choice, as the filter cut-off should be fixed on about( fs/2)*0.85. 

For recorder devices, it is recommended that the input amplitude of 
the analog source signal be limited to about 80% of the dynamic range 
of the internal or external A/D converter. 

It is recommended that the components for external output filters 
and amplifiers be carefully selected. An incorrect choice would impair 
the original quality the speech chips are designed to produce. This 
consideration equally includes the careful separation of analog and 


digital lines, the grounding of analog lines at both ends and further 
adequate separation from highspeed digital circuits to avoid distortions 
thereof. 

Maximum reproduction quality can be achieved with devices 
incorporating a 12-bit A/D converter and 12-bit D/A converter offering 
the fine resolution of 4096 vertical steps. An 8-bit device provides 256 
steps, which makes a noticable difference in sound perception. The 
time for which a chip produces speech is determined by the available 
memory capacity and the bit-rate. The latter is composed of the product 
of the sampling frequency and the number of bits per sample (4 bits). 
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Programming Example 


Sample Routine for MSM80C154 
Interfaced With MSM6388 


A typical sample program shows the amount of programming 
effort for essential control tasks. 

The assembler routine written for an Intel-compatible MSM80C154 
demonstrates the functions of MSM6388. "EXEC" is a subroutine 
which sends a command to MSM6388 and sends or receives the 
corresponding data to/from the recorder chip. The complete command 
is passed to the subroutine in the internal RAM starting from the 
address "COMND" which mustbe bit addressable. The address COMND 
contains the MSM6388 command in the low order four bits. The high 
order nibble must contain all zeros. Successive addresses contain the 
corresponding data nibbles for certain commands in the low order four 
bits. The commands DTRD, DTWR, EREC and EPLAY are not 
supported by this example routine. 

The command table, status register and hardware configuration are 
given for reference on this page. Most of the commands are one-nibble 
instructions which are executed upon transmission. SAMP selects the 
sampling frequency while CHAN determines one of 8 channels. Both 
commands expect another data nibble to follow. Address read/write 
operation requires maximum three subsequent more nibbles containing 
the relevant addresses. 

More than 8 channels can be defined if the controller provides the 
start and stop addresses. In this case, channel 0 must be selected. Start 
and stop addresses are then written into the index area of channel 0 and 
fetched by MSM6388 prior to a recording or playback operation. 


MSM80C154 


MSM6388 


Though tailored for 
MSM6388, by knowledge 
of the instruction table of 
MSM6588 and MSM6688 
this routine is easily 
adaptable. For details, 
reference is made to the 
latest Voice LSI data book. 


THIS ROUTINE PASSES A 6388-COMMAND TO THE SUBROUTINE EXEC 


;RO=COMMAND DATA POINTER 


;R1=COMMAND NIBBLE COUNTER FOR SAMP, CHAN, STWR, SPWR, STAD, SPRD 
;ROUTINE DOES NOT SUPPORT COMMANDS DTRD, DTWR, EPLAY, EREC 


GETD: CE EQU P3.3 


EXEC 


INC 
JNB COMND.3, AC_ST 
MOV 


Ri, #4 

AC_ST MOV A, @RO 

JB AGC.O, $-1 

MOV A, @RO 

MOVX @RO0, A 

ING RO 

DJNZ Ri, AC_ST1 
AC_STAT : MOV A, COMND 

CJNE A, #4, AC_STOP 

MOVX A, @RO 

JB AGC. 0, $-1 

SJMP MA_RT 
AC_STOP : CJNE A, #5, MA_RT 

MOVX A, @RO 

JB ACG.1, $-1 

SJMP MA_RT 
AC_ST1 JB COMND. 2, AC_ST 

INC R1 
AC_AD MOV R2, #3 
AC_AD1 JB VCK, $ 

JNB VCK, $ 

DINZ R2, AC_AD1 

DJNZ Ri, AG_AD2 

SJMP MA_AT 
AC_AD2 JB COMND. 1, AC_AD3 

MOV A, @RO 

MOVxX @RO0.A 

SJMP AC_AT 
AC_AD3 MOVX A, @RO 

MOV @R0,A 
AC_RT ING RO 


MA_RAT : SETB CE 


;PORT 3.3 = CE 6388 

;PORT 3.5 = 2VCK 6388 
;ADDRESS IN INTERNAL RAM 806154 
;COMMAND DATA 


;EXAMPLE; SPWR 
32, NIBBLE 
33. NIBBLE 
‘4. NIBBLE 


{ENABLE 6388, CE=0 


;COMMAND INPUT 

GET COMMAND 
7141111010 + COMMAND, COMMAND<6? 
DATA COUNTER 

;SINGLE COMMAND 


JUMP IF SAMP, CHAN 

4 NIBBLE COMMANDS 
;LOAD DATA FROM 6388 TO ACC 
;BUSY CHECK, LOOP UNTIL BUSY=0 
{GET COMMAND 
;WRITE COMMAND 
{GET COMMAND 


;GET BUSY 
iBUSY CHECK, LOOP UNTIL BUSY=0 


{WAIT 3 VCK CYCLES 


}READY 


JUMP IF STRD, SPAD 


WAITE COMMAND 


iREAD COMMAND 


{DATA SET 


{DISABLE 6388 
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AR76-202 


In-House Speech Code Development 


...eliminates the need for expensive analysis charges 
and gives you the freedom to edit your codes 
individually whenever required. Moreover, the 
provision of the instantly coded EPROM enables the 
length of the production process to be reduced quite 
remarkably. 


Outline of AR76-202 

AR76-202 is fully PC supported. Included with the shipment 
is a standard 16-bit PC slot card, an EPROM programmer, a 
software driver and an English manual. 

The host environment should consist of a PC-AT® or 100% 
compatible machine equipped with a harddisk, VGA® graphic 
adaptor and serial mouse. A printer is useful but not essential. 
The driver software runs under DOS 3.2 and higher on a 640K 
machine. However, for speech phrases of more than 16 seconds, 
EMS expansions are strongly recommended in order to record 
long phrases in one piece without concatenations. A tape deck 
or cassette player is needed as an analog source which can be 
connected directly to the installed board as well as aloudspeaker. 


Together with AR76-202 a broad range of demonstration 
kits are available for selected speech LSIs. For in-socket 
evaluation of EPROM codes generated with your PC 
developmenttool for masked speech synthesizers, the evaluation 
board 6376 is the right choice as it simulates all functions of the 
target device, including mask bonding options. 


| Likewise, the external EPROM/OTP writer is linked to the 


START-UP SCREEN 


Performance 
What AR76-202 does for you comprises all essentially 
demanded functions from recording all the way up to EPROM 
programming, plus a few extras. In detail: 
¢ Recording into host memory through an audio line input 
¢ Playback of a recording from host memory via speaker 
¢ Editing of a recorded voice file, including 
- Amplitude manipulation 
- Silencing / Silence insertion 
- Fading / Cut / Copy / Paste 
¢ HEX-file or BIN-file generation 
¢e EPROM/OTP programming 
¢e EPROM duplication 
° Random access playback of speech files 
¢ Covers all OKI speech LSIs, including OTP synthesizers 
¢ Makes full use of Expanded Memory System, EMS 


Hardware 

Measuring approximately 284(L) x 107(W) mm, the slot 
card is designed around OKI Electric’s MSM5218 speech 
recorder LSI in connection with external 12-bit converters plus 
low pass filter featuring -48dB attenuation per octave. Both 
analog and digital circuits are provided through to analog inputs 
and outputs for direct connection of a tape deck and a speaker. 


card‘s interface. 


Software 

The software driver, "ONSAKU", is fully graphics oriented 
and operable with keys or preferably with a mouse. The primary 
application of the software is for editing raw speech recordings. 
This includes amplitude amplification and attenuation, cutting, 
copying and insertion of selected excerpts. Additionally, the 
auto-fade in/fade out function helps to save time. Using this 
software is like using a modern wordprocessor. A recorded 
waveform is displayed on the graphic screen in three scales 
allowing for precise editing at any point. 

"ONSAKU" can be started with a variety of option switches 
in order to adjust operating parameters for more convenience 
and easier further processing. 
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